TTGTAACAGA AAATTAAAAT ATACTCCACT CAAGGGAATT CTGTACTTTG CCCTTTTGGT -9 9 
A^.AGTCTCAT TTACATTTCT AAACCTTTCT TAAGAAAATC GAATTTCCTT TGATCTCTCT -3 9 

-IMTS CH I 
rCTGAATTGC AGAAATCAGA TAAAAACTAC TTGGTGAA ATG ACT TCT T6T CAC ATT 18 
AEEHIQKVAIFGGTHG 
GCT GAA GAA CAT ATA CAA AAG GTT GCT ATC TTT GGA GGA ACC" CAT GGG 66 



23 


N 


E 


L 


T 










V 







W 


L 


E 


N 


G 






AAT 


GAG 


CTA 


ACC 


GGA 


GTA 


TTT 


CTG 




AAr 


AT 


TGG 


CTA 


GAG 


AAT 


GGC 


114 


39 


A 


E 


I 


Q 


R 


T 




L 














T 


N 






GCT 


GAG 


ATT 


CAG 


AGA 


ACA 


GGG 


CTG 






AAA 

AAA 


CCA 


TTT 


ATT 


ACT 


AAC 


162 


55 


p 


R 




V 


f( 












J" 






D 


L 


N 




CCC 


AGA 


GCA 


GTG 


AAG 


AAG 


rpprr. 




AGA 


TAT 


ATT 


GAC 


TGT 


GAC 


CTG 


AAT 


210 


71 


R 


I 


F 








N 


L 








M 




E 


D 


L 






CGC 


ATT 


TTT 


GAC 


CTT 


GAA 


AAT 


CTT 






AAA 


ATG 


TCA 


GAA 


GAT 


TTG 


2 58 


87 


p 


Y 


E 


V 


R 


AG^ 




Q 












F 


G 


P 






CCA 


TAT 


GAA 










CAA 


AA 


ATA 


AAT 


CAT 


TTA 


TTT 


GGT 


CCA 


3 06 


103 


K 


D 






















L 


H 


N 


T 






AAA 


GAC 


AGT 


GAA 


GAT 


TCC 


TAT 


GAC 


ATT 


ATT 


TTT 


GAC 


CTT 


CAC 


AAC 


ACC 


354 


119 


T 


s 


N 


M 


Q 




T 












S 


R 


N 


N 






ACC 


TCT 


AAC 


ATG 




TGC 


ACT 






CTT 


GAG 


GAT 


TCC 


AGG 


AAT 


AAC 


402 


13=5 


F 


jj 




Q 






/l 








T 


S 


L 


A 


P 


L 








TTA 


ATT 


CAG 




mmm 


CAT 


TAC 


ATT 


AAG 


ACT 


TCT 


CTG 


GCT 


CCA 


CTA 


450 


15=i 


p 




















S 


L 


K 


Y 


A 


T 






CCC 


TGC 


rpap 


GTT 


TAT 


CTG 


ATT 


GAG 


CAT 


CCT 


TCC 


CTC 


AAA 


TAT 


GCG 


ACC 


498 


167 


T 














P 




G 


I 


E 


V 


G 


P 


Q 








CGT 


T-nr" 


ATA 


GCC 


AAG 


TAT 


CCT 


GTG 


GGT 


ATA 


GAA 


GTT 


GGT 


CCT 


CAG 


546 


18 i 


p 


Q 














I 


L 


D 


Q 


M 


R 


K 


M 






CCT 


CAA 


GGG 


CTT 


PTT 


AGA 


GCT 


GAi: 


ATC 


TTG 


GAT 


CAA 


ATG 


AGA 


AAA 


ATG 


594 


199 


I 








T 






"I 








N 


E 


G 


K 


E 






ATT 


AAA 


CAT 




PTT 


GAT 


TT^ 


ATA 


CAT 


CAT 


TTC 


AAT 


GAA 


GGA 


AAA 


GAA 


642 


2 15 










A 










K 


I 


I 


E 


K 


V 


D 






TTT 


CCT 


CCC 


TGC 


GCC 


ATT 


GAG 


GTC 


TAT 


AAA 


ATT 


ATA 


GAG 


AAA 


GTT 


GAT 


690 


231 


Y 


P 


R 


D 


E 


N 


G 


E 


I 


A 


A 


I 


I 


H 


P 


N 






TAG 


CCC 


CGG 


GAT 


GAA 


AAT 


GGA 


GAA 


ATT 


GCT 


GCT 


ATC 


ATC 


CAT 


CCT 


AAT 


738 


247 


L 


Q 


D 


Q 


D 


W 


K 


P 


L 


H 


P 


G 


D 


P 


M 


F 






CTG 


CAG 


GAT 


CAA 


GAC 


TGG 


AAA 


CCA 


CTG 


CAT 


CCT 


GGG 


GAT 


CCC 


ATG 


TTT 


786 


263 


L 


T 


L 


D 


G 


K 


T 


I 


P 


L 


G 


G 


D 


C 


T 


V 






TTA 


ACT 


CTT 


GAT 


GGG 


AAG 


ACG 


ATC 


CCA 


CTG 


GGC 


GGA 


GAC 


TGT 


ACC 


GTG 


834 


279 


Y 


P 


V 


F 


V 


N 


E 


A 


A 


Y 


Y 


E 


K 


K 


E 


A 






TAC 


CCC 


GTG 


TTT 


GTG 


AAT 


GAG 


GCC 


GCA 


TAT 


TAC 


GAA 


AAG 


AAA 


GAA 


GCT 


882 


295 


F 


A 


K 


T 


T 


K 


L 


T 


L 


N 


A 


K 


S 


I 


R 


C 






TTT 


GCA 


AAG 


ACA 


ACT 


AAA 


CTA 


ACG 


CTC 


AAT 


GCA 


AAA 


AGT 


ATT 


CGC 


TGC 


930 


311 


C 


L 


H 



























TGT TTA CAT TAG AA ATCACTTCCA GCTTACATCT TACACGGTGT CTTACAAATT 9 84 
CTGCTAGTCT GTAAGCTCCT TAAGAGTAGG GTTGTGCCTT ATTCAACTGC ATACATAGCT 1044 
CCTAGCACAG TGCCTTATTC GGTAGGCATC TAAGCAAATT TCTTAAATTA ATTAATATAT 1104 
CTTTAAAGAT ATCATATTTT ATGTATGTAG CTTATTCAAA GAAGTGTTTC CTATTTCTAT 1164 
ATAGTTTATT ATACATGATA CTTGGGTAGC TCAACATTCT TAATAAACAG CCTTTGTATT 12 3 4 
CAGAA'TA'TAA AATTGAAATA GATATATATA AAGTTAAAAA XAAAXAAAAA AAA 12 8 7 



Fig- 1 



lOv 20v 30v 40v 50v 

HLASP MTSCHIAEEHIQKVAIFG<3^DHGNELTGVFLVKHWLENGAEIQRTGLEVKPF 
MTSCH: AE: . I : KVAIFGGTHGNELTGVFLVKHWLEN : : EIQRTGLEVKPF 
BASPCDNA MTSCHVAEDPIKKVAIFGCslH'GNELTGVFLVKHWLENSTEIQRTGLEVKPF 
10-" 20'" ■ 30" 40" 50" 

60v 70V 80v 90v IGOv 

HLASP itnpravkkctryidcdlnrifdlenlgkkkSedlpyevrraqeinhlfgp 

ITNPRAVKKCTRYIDCDLNR: FD ENLGKK. SEDLPYEVRRAQEINHLFGP 

BASPCDNA itnpravkkctryidcdlnrvfdpenlgkkkSedlpyevrraqeinhlfgp 

60" 70" 80" 90" 100" 

llOv 120V 130V 140v 150V 

HLASP kcSeesS ydiifdlhn*ttsnmgctliledsrnnfliqmfh Y iktS lap L P C Y 
C3 KDSEDSYDIIFDLHN*TTSNMGCTLILEDSRN: fliqmfhyiktslaplpcy 

B^PCDNA kc^ecSydiifdlhn'ttsnmgctliledsrndfliqmfhyiktSlaplpcy 

i 110" 120" 130" 140" 150" 

m 160V 170V ISOv 190v 200V 

HLAS P VYLI EHP SLK YATTRS I AKYPVGI EVGPQPQGVLRAD I LDQMRKM I KH A LD 
VYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMI : HALD 
bSIpcdna VYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIQHALD 
160" 170" 180" 190" 200" 

210v 220v 230v 240v 250v 

HLASP FIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQDWKPL 
FIH:FNEGKEFPPCAIEVYKI: KVDYPR:E:GEI:AIIHP:LQDQDWKPL 
B^PCDNA FIHNFNEGKEFPPCAIEVYKIMRKVDYPRNESGEISAIIHPKLQDQDWKPL 
•f 210" 220" 230" 240" 250" 

I 260v 270v 280v 290v 300v 

HM.SP HPGDPMFLS'LDGKTIPLGGDG'i3?YPVFSWS 

HP . DP : FLTLDGKTIPLGGD TVYPVFVNEAAYYEKKEAFAKTTKLTLNA : 
B ASP CDNA HP EDP VFLt'LDGKTIPLGGDQTV^YPVFyNEAA^YYEKKE AFAKTTK LT LNAN 
260" 270" " 'S^O" ""^"=-"2'^0" 300" 

310v 

HLASP SIRCCLH 

SIR-.LH 
BASPCDNA SIRSSLH 

310" 



Fig. 2 



Fig. 3 
Kaul et al. 



-{. 



ATG 



TAG 



Fig. A 

Kaul et al. 




Fig. 5 
Kaul et al. 




Fig. 6 
Kaul e t al . 



MAPSEQ V5 . 33 -HASP. SEQ(1, 1277) Reading frames: 1 Enzyme file ALL.ENZ LinPage 1 



EAM 


M 


E 


M 


N 


DSNDSBB 


ASA 


B 


C 


N 


L 


STCSESS 


MPE 


0 


5 . 


L 


'a 


AYOACAA 


1E3 


2 


7 


1 


4 


lllllJJ 



/ ///// 

ATGACTTCTTGTCACATTGCTGAAGAACATATACAAAAGGTTGCTATCTTTGGAGGAACC 
. + _ + , + . + . + . + 60 




TACTGAAGAACAGTGTAACGACTTCTTGTATATGTTTTCCAACGATAGAAACCTCCTTGG 
mtschiaeehiqkvaifggt 



BBH 
SOP 
AAA 
W7 2 
// 



y 

TSM 
RPS 
UOE 
911 

■A'AA 



RM 
MA 
AE 



H HHD 
I HAD 
N AEE 
P 121 
I 



11 
/ 



CATGGGAATGAGCTAACCGGAGTATTTCTGGTTAAGCATTGGCTAGAGAATGGCGCTGAG 

_ , ^ , -1- _ 1 . + . + . V 

GTACCCTTACTCGATTCGCCTCATAAAGACCAATTCGT?J^CCGATCTCTTACCGCGACTC 



L M 
1 1 

ATTCAGAGAACAGGGCTGGAGGTAAAACCATTTATTACTAACCCCAGAGCAGTGAAGAAG 

TAAGTCTCTTGTCCCGACCTCCATTTTGGTAAATAATGATTGGGGTCTCGTCACTTCTTC 

igrtglevkpf itnpravkk 




/ its 



ySEQ V5.33 HASP. SEQ(1, 1277) Reading frames: i Enzyrae file ALL.ENZ LinPage 2 

CR - M ■ M TH 

SS B A FN 

PA O E IF ■ ■ 

61 2 3 ■ 11 • 

/ 

TGTACCAGATATATTGACTGTGACCTGAATCGCATTTTTGACCTTGAAAATCTTGGCAAA 

- + • + + + + . + 240 

ACATGGTCTATATAACTGACACTGGACTTAGCGTAAAAACTGGAACTTTTAGAACCGTTT 

ctryidcdlnrifdlenlQk 



NM BN 

DB AS 

EO NP 

12 22 

AAAATGTCAGAAGATTTGCCATATGAAGTGAGAAGGGCTCAAGAAATAAATCATTTATTT 

• + . + + . + . + . + 300 

TTTTACAGTCTTCTAAACGGTATACTTCACTCTTCCCGAGTTCTTTATTTAGTAAATAAA 

kmsedlpyevrraqeinhlf 



A TH M S 

V FN B P 

A IF O O 

2 11 2 1 

/ 

GGTCCAAAAGACAGTGAAGATTCCTATGACATTATTTTTGACCTTCACAACACCACCTCT 

- + • + . + . + . + . + 360 

CCAGGTTTTCTGTCACTTCTAAGGATACTGTAATAAAAACTGGAAGTGTTGTGGTGGAGA 

gpk-dsedsydiifdlhntts 



V5.33 HASP. SEQ(1, 1277) Reading frames: ■ l Enzyme file ALL.ENZ LinPage 3 

MN A HBN M TH E AS Tl/ 

NL P GSS N " FN .C PC RS 

LA L IIP L . • IF R YR UE 

13 1 AH2 1 11 2 11 91 

III II J/ 

AACATGGGGTGCACTCTTATTCTTGAGGATTCCAGGAATAACTTTTTAATTCAGATGTTT 

. + . + . -+ . + . 4- . + 420 

TTGTACCCCACGTGAGAATAAGAACTCCTAAGGTCCTTATTGAAAAATTAAGTCTACAAA 

nmgctliledsrnnfl iqmf 



TM N M F E 



RS 



C 



UE A E K 

91 4 2 1 B 

/ 

CATTACATTAAGACTTCTCTGGCTCCACTACCCTGCTACGTTTATCTGATTGAGCATCCT 

- + . + , + . + . + . + 480 

GTAATGTAATTCTGAAGAGACCGAGGTGATGGGACGATGCAAATAGACTAACTCGTAGGA 



hyiktslaplpcyvyl 



P 



A L 
N 1 



TCCCTCAAATATGCGACCACTCGTTCCATAGCCAAGTATCCTGTGGGTATAGAAGTTGGT 
ACrGGAGTTTATACGCTGGTGAGCAAGGTATCGGTTCATAGGACACCCATATCTTCAACCA 



540 



Wsm V5.3 3 HASP -SEQ( 1,1277) Reading frames: l -Enzyme file^ALL.ENZ LxnPage 4 

D M M D A E BMDD ■ , 

D' N N D L C IBPP RS ('^^ 

• E • L . . L E U R NONN ' UE 

1 ■ 1 11 IV 1121 91 

CCTCAGCCTCAAGGGGTTCTGAGAGCTGATATCTTGGATCAAATGAGAAAAATGATTAAA 

• + • + + . + . + . + 600 

GGAGTCGGAGTTCCCCAAGACTCTCGACTATAGAACCTAGTTTACTCTTTTTACTAATTT 

pqpqgvlradildqmrkmik 



NN HMHM 
SL 
PA 



INHN 



NLAL 

^-^ Pill 
/ / 
CATGCTCTTGATTTTATACATCATTTCAATGAAGGAAAAGAATTTCCTCCCTGCGCCATT 

- + . + . + . + , h 

GTACGAGAACTAAAATATGTAGTAAAGTTACTTCCTTTTCTTAAAGGAGGGACGCGGTAA 

haldfihhfnegkefppcai 



E BSBNXSASSBBHNSB FF F IF 

*^ SESCMMVCESSPCCB OO O TN 

P 'T'^^r ACAIAAARCAAAIRV KK K AU 

1 ' J1J111111JJ2111 11 1 IH 

GAGGTCTATAAAATTATAGAGAAAGTTGAfTACCCCCGGGATGAAA^ 

• + • + • + . H . + . + 

CTCCAGATATTTTAATATCTCTTTCAACTAATGGGGGCCCTACTTTTACCTCTTTAACGA 



V5:33 HASP. SEQC 1,1277) Reading frames: 1 Enzyme file ALL.ENZ LinPage 5 



s 


PBMDD 


F 




F 


SIBPP 


0 


CEPCSSFIHALBPPI 


C 


TNONN 


. K 


RCYRAAANOMAONNN 


1 


11121 


1 


2111JJN12141211 




nil 




/// llllllll 



GCTATCATCCATCCTAATCTGCAGGATCAAGACTGGAAACCACTGCATCCTGGGGATCCC 

^ + , + , + . + + 

CGATAGTAGGTAGGATTAGACGTCCTAGTTCTGACCTTTGGTGACGTAGGACCCCTAGGG 



v/ 
TM 
RS 
UE 
91 
/ 



MDBBBDBMA . BBAB 
BPBSPPIBL SSCS 
ONSCUNNOW ILIM 
121911122 Y112 
/ //// /.// 



CR 
SS 
PA 
61 



CR 
SS 
PA 



atgtttttMctcttgatgggaagacgatcccactggg^ggagactgtaccgtgtacccc 

TACAAAAATTGAGAACTACCCTTCTGCTAGGGTGACCQGCCTCTGACATGGCACATGGGG 
mfltldgktiplggdctvyp 



SM HIFA H A 

PN ATNC I L 

OL EAUI N U 

11 31H1 3 1 

/ ^ // y- 

GTGTTTGTGAATGAGGCCGCATATTACGAAAAGAAAGAAGCTTTTGCAAAGACAACTAAA 

^ + ^ + . + , + + . + 900 

CACAAACACTTACTCCGGCGTATAATGCTTTTCTTTCTTCGAAAACGTTTCTGTTGATTT 



V5.3 3- HASP. SEQ (1, 1277) Reading frames: 1 Enzyme file ALL. ENZ LinPage 



S - B ■ HIF E ■ A 

P B. NTN C L. 

O V . FAU 1 ■ • ' U'-. 

. ' 1 1 ■ • 3 IH 5 -■ . • ■ 1 . 

^ / 

CTAACGCTCAATGCAAAAAGTATTCGCTGCTGTTTACATTAGAAATCACTTCCAGCTTAC 



GATTGCGAGTTACGTTTTTCATAAGCGACGACAAATGTAATCTTTAGTGAAGGTCGAATG 



RM A ATM 

MA L FRS 

AE U LUE 

11 1 291 

/ / 
ATCTTACACGGTGTCTTACAAATTCTGCTAGTCTGTAAGCTCCTTAAGAGTAGGGTTGTG 

. + . + . + . + . + . + 

TAGAATGTGCCACAGAATGTTTAAGACGATCAGACATTCGAGGAATTCTCATCCCAACAC 

i Ihgvlq i 1 Ivckl Iks rvv 



B 


A 


RM 


H 


D 


S 


S 


L 


MA 


N 


D 


F 


P 


U 


AE 


F 


E 


A 


W 


1 


11 
/ 


3 


1 


N 



CCTTATTCAACTGCATACATAGCTCCTAGCACAGTGCCTTATTCGGTAGGCATCTAAGCA 

. + . ■-+ . + . + . + . + 1080 

GGAATAAGTTGACGTATGTATCGAGGATCGTGTCACGGAATAAGCCATCCGTAGATTCGT 



V5.33 HASP. SEQ(1, 1277) Reading frames: 1 Enzyme file ALL.ENZ LinPage 7 

J y l/ J 

TM^ ATM PATH TDM E A 

RS SRS ASRS RRS C L 

UE EUE CEUE UAE R U 

91 191 1191 911 V 1 

/ // /// / 

AATTTCTTAAATTAATTAATATATCTTTAAAGATATCATATTTTATGTATGTAGCTTATT 

TTAAAGAATTTAATTAATTATATAGAAATTTCTATAGTATAAAATACATACATCGAATAA 



X N A 

M L L 

N A U 

1 3 1 

CAAAGAAGTGTTTCCTATTTCTATATAGTTTATTATACATGATACTTGGGTAGCTCAACA 

GTTTCTTCACAAAGGATAAAGATATATCAAATAATATGTACTATGAACCCATCGAGTTGT 



TM TM 
RS RS 
UE UE 
91 91 
/ / 
TTCTTAATAAACAGCCTTTGTATTCAGAATATAAAATTGAAATAGATATATATAAAGTTA 

AAGAATTATTTGTCGGAAACATAAGTCTTATATTTTAACTTTATCTATATATATTTCAAT 



nslcianikl 



1200 



AAAAAAAAAAAAAAAAA 

. + . — 1277 

k k k k k k 



vSEQ V5.33 HASP . SEQ (1, 1277) Reading frames: 



Enzyme file ALL. EN: 



S B HIF E A 

P B NTN C L 

0 V FAU 1 U 

1 1 3 IK 5 1 

^ / 

CTAACGCTCAATGCAAAAAGTATTCGCTGCTGTTTACATTAGAAATCACTTCCAGCTTAC I 



GATTGCGAGTTACGTTTTTCATAAGCGACGACAAATGTAATCTTTAGTGAAGGTCGAATG 
Itlnaksircclh. 



RM A ATM • ' ' 

MA L FRS 

AE U LUE 

11 1 291 

/ / 
ATCTTACACGGTGTCTTACAAATTCTGCTAGTCTGTAAGCTCCTTAAGAGTAGGGTTGTG 

TAGAATGTGCCACAGAATGTTT4ACIACGA5-C^AGA.CATT 



,B A _ RM H D S 

'S^ 'J. MJ\ N D _ F 

P U AE F E A 

W 1 11 3 - - , 1 N - 

/ 

CCTTATTCAACTGCATACATAGCTCCTAGCACAGTGCCTTATTCGGTAGGCATCTAAGCA 
GGAATAAGTTGACGTATGTATCGAGGATCGTGTCACGGAATAAGCCATCCGTAGATTCGT 



■?SEQ V5.33 HASP. SEQ(1, 1277) Reading frames: 1 Enzvme file ALL.ENZ LinPa^< 

J - y 1/ -y ' ^ 

TM-^ ATM PATM TDM E A 

RS SRS ASRS RRS C L 

UE EUE CEUE UAE R U 

91 191 1191 911 V 1 

/ // /// / 
AATTTCTTAAATTAATTAATATATCTTTAAAGATATCATATTTTATGTATGTAGCTTATT 



TTAAAGAATTTAATTAATTATATAGAAATTTCTATAGTATAA.AA.TACATACATCGAAT.AA 



CAA^AGAAGTGTTTCCTATTTCTATATAGTTTATTATACAT.GATACTTr.&GmGCTCAACA 
GTTTCTTCAGAAAGGATAAAGAT AT ATCAAATAAaV#5 ^2 0 0 



TM 
RS 
UE 
91 



TM 
RS 



91 



/ 
FT 

AAGAATTATTTGTCGGAAACATAAGTCTTATATTTTAACTTTATCTATATATATTTCAA' 



TTCTTAATAAACAGCCTTTGTATTCAGAATATAAAATTGAAATAGATATATATAAAGTTA 



AAAAAAAAAAAAAAA-AA 

. + .-- 1277 

TTTTTTTTTTTTTTTTT 



